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ABSTRACT 


EXPERIMENTAL RESULTS AND ISSUES ON 
EQUALIZATION FOR NONLINEAR MEMORY CHANNEL 
- PRE-CUSOR ENHANCED RAM-DFE CANCELER 

BY 

LU YUAN 

Master of Science in Electrical Engineering 
New Mexico State University 
Las Cruces, New Mexico, 1998 
Dr. James P. LeBlanc, Chair 

This thesis investigates the effects of the High Power Amplifier (HP A) and 
the filters over a satellite or telemetry channel. The Volterra series expression is 
presented for the nonlinear channel with memory, and the algorithm is based on the 
finite-state machine model. A RAM-based algorithm operating on the receiver side — 
Pre-cursor Enhanced RAM-FSE Canceler (PERC) is developed. A high order 
modulation scheme — 16-QAM is used for simulation, the results show that PERC 
provides an efficient and reliable method to transmit data on the bandlimited 
nonlinear channel. 
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The contribution of PERC algorithm is that it includes both pre-cursors and 
post-cursors as the RAM address lines, and suggests a new way to make decision on 
the pre-addresses. Compared with the RAM-DFE structure that only includes post- 
addresses, the BER versus Eb/NO performance of PERC is substantially enhanced. 
Experiments are performed for PERC algorithms with different parameters on 
AWGN channels, and the results are compared and analyzed. 

The investigation of this thesis includes software simulation and hardware 
verification. Hardware is setup to collect actual TWT data. Simulation on both the 
software-generated data and the real-world data are performed. Practical limitations 
are considered for the hardware collected data. Simulation results verified the 
reliability of the PERC algorithm. 

This work was conducted at NMSU in the Center for Space Telemetering and 
Telecommunications Systems in the Klipsch School of Electrical and Computer 
Engineering Department and is supported by grant AX-6425 from Sandia National 


Labs. 
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CHAPTER 1 INTRODUCTION AND BACKGROUND 


1.1 Problem description 

With the rapid development of satellite communications, the frequency band 
is becoming more and more congested, the need to transmit more data through the 
bandwidth limited channel is becoming more and more compelling. To meet these 
needs, it is needed to either transmit data at a faster symbol rate or design higher 
order modulation schemes, which are more bandwidth efficient, such as Quadrature 
Amplitude Modulation (QAM). However, increasing symbol rate will also increase 
the Inter Symbol Interference (ISI), and the QAM scheme faces decoding difficulty 
because of the High Power Amplifier (HP A) involved in the satellite channel. To 
achieve power efficiency, the HPA works in its saturation region, which brings 
nonlinearity to the system. The QAM constellation will be distorted both in 
amplitude and in phase. Through bandlimited channels, the received data will be a 
warped constellation of clusters. This research is focused on a nonlinear equalizer — 
Pre-cursor Enhanced RAM-DFE Canceler (PERC), which provides a method to 
successfully use the higher-order modulation scheme through nonlinear memory 
channel. 
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1.2 Review of previous work 

The methods proposed for transmission through nonlinear memory channel 
can be divided into two categories: operation on the transmitter and operation on the 
receiver. 

The basic idea of the transmitter-based techniques is pre-distortion, it 
performs an “inverse” nonlinear transform before the HPA such that the nonlinearity 
caused by HPA can be canceled. Both continuous-waveform predistortion which 
consists of an analog device [1] and data predistortion before modulator [2] [3] [4] 
have been proposed. The continuous-waveform predistorters are more complex and 
expensive, less flexible, and their performance is not as good as their digital 
counterparts [2], Memory less predistortion and predistortion with memory techniques 
are developed for digital data predistortion. Saleh and Salz [5] proposed a pioneering 
paper about memoryless predistortion, the compensator pre-warps the source 
constellation so that after the nonlinear transformation of HPA, the desired 
constellation can be achieved. However, in bandlimited cases, the clustering due to 
the nonlinear ISI of the channel can not be removed by such a memory less device. A 
predistortion with memory scheme is initially presented by E. Biglieri, et al, in [2], 
based on the theory of p th - order inverse systems. A. Bemardini and S. De Fina [3] 
developed the idea by setting up the structure of two linear blocks with memory 
separated by a memoryless nonlinearity device. G. Lazzarin [4] further simplified the 
structure, the predistortion part is composed only of digital filters and memoryless 
nonlinear devices that works at the symbol rate. The predistortion with memory 
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schemes above are all based on the Volterra Model of the system. The transmitter- 
based techniques require a local receiver at the transmitter, and a feedback path to 
adaptively adjust the predistortion parameters, which increases the complexity and 
power consumption of the transmitter. 

Schemes focused on the receiver side are proposed herein attempting to 
minimize the effects of the nonlinearity, ISI and noise. M. F. Meysiya, et al. [6] 
proposed the maximum likelihood sequence estimation (MLSE) scheme, which is 
based on Viterbi Algorithm, however, because of its complexity, it is not practical for 
implementation of very high data rates. Nonlinear equalizer schemes based on 
Volterra Model are proposed in [7] by D. D. Falconer and in [8] by S. Benedetto. In 
[9], an FSE-Volterra combined equalizer is suggested, which has a better 
performance. The Volterra equalizers act as an “inverse” of the nonlinear memory 
channel, but the noise in the channel is also amplified and distorted [10], To prevent 
the excessive noise enhancement, equalizers that involve a random access memory 
(RAM) as a look-up table are developed. In these schemes, the local decisions on the 
received symbols are used as the address lines to access the RAM, and the content of 
the RAM compensates for the nonlinear memory channel. By adding the 
compensation part to the incoming distorted symbols, the desired symbols can be 
recovered. Different approaches are made to decide the address lines of the RAM. 
The RAM-DFE was proposed by K. Fisher, et al. In [11], which employs the 
Decision feedback structure, the past decisions are used as the address lines (post- 
addresses only) to access the RAM. However, often in the channel with ISI, both 
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previous and future symbols are interfering the current symbol, so a better approach is 
to also involve current and future symbols as address lines, in other words, the RAM 
address lines involve both post-addresses and pre-addresses. RAM-canceler schemes 
are proposed by E. Bigieri, et al. in [12] for voiceband data transmission, and O. 
Agazzi and N. Seshadri in [13] proposed schemes for magnetic recording channels, 
tentative symbol decisions are made on the current and future symbols that serve as 
pre-addresses. However, the inaccuracy of the tentative decision limits its BER 
performance. To make more reliable decision on current and future symbols, PERC 
algorithm is proposed. 

1.3 Scheme description and overview of chapters 

The PERC algorithm is first suggested by J. LeBlanc et al, in [10], it is a 
RAM-based algorithm operating on the receiver side. It provides a more reliable way 
to make decisions on the pre-addresses by testing over all the possible pre-address 
combinations and picking the best combination to make local decision on the current 
symbol. A Fractionally Spaced Equalizer (FSE) is involved before the RAM 
structure for synchronizing considerations. 

It will be convenient to study on the lowpass equivalent discrete time channel 
model of the bandpass satellite system channel. Chapter 1 presents the lowpass 
equivalent model of the system, and also describes the models used in simulation for 
each part of the channel. The research work includes two parts, software simulation 
and hardware verification. The algorithm is tested by the software simulation first, 
followed by software implementation of PERC on the real data collected by 


4 



hardware. Chapter 3 presents a description about the algorithm, software simulation is 
presented in Chapter 4. Chapter 5 gives the outline of hardware setup and real-data 
implementation results, practical factors are considered for real-data. Chapter 6 gives 
conclusions and suggestions for future work on this topic. A detailed description 
about the setup and configuration of the each equipment in the hardware is provided 
in Appendix A. The power input-output curve for the TWTA and SSPA used in the 
hardware is measured and presented in Appendix B. The characteristics of the BLP- 
1.9 filter used in the hardware setup are provided in Appendix C. Finally the source 
codes in MATLAB for the simulation is given in Appendix C. 
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CHAPTER 2 THE CHANNEL MODEL 


The channel of study is chosen to be representative of a satellite or telemetry 
channel which uses a nonlinear amplifier. The communication system is a bandpass 
system, and Traveling Wave Tube Amplifier working at S-band (1550-5200MHz). It 
will be convenient to study on its low-pass equivalent model [14]. The low-pass 
equivalent model of the channel can be considered as: 


noise 


s(k) 



Figure 2-1 Low-pass equivalent model of the channel 

Other parts of the system that are not included in this model, such as the 
modulator, the demodulator and so on, are considered to be ideal. s(k) is the digital 
symbol sequence after modulation, the D/C converter is assumed to be ideal, the 
pulse shaping is a rectangular function of unit amplitude over a time period of length 
T (symbol interval of s(k)). A Traveling Wave Tube Amplifier (TWTA) is contained 
in the bandlimited Additive White Gaussian Noise (AWGN) channel. Filter I is a 
lowpass pre-filter to limit the TWTA input noise band, filter II is a lowpass post-filter 
to limit the spectral emissions of the output of TWTA into other frequency bands. 
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2.1 16-QAM 


Because of the nonlinear effect of the TWT amplifier, a typical modulation 
format for communications is M phase shift keying (M-PSK). However, Sixteen State 
Quadrature Amplitude Modulation (16-QAM) has recently been considered for 
communications. The modulation scheme studied in this thesis is 16-QAM. 

The functional block diagram of QAM modulator is: 



Figure 2-2 16-QAM modulation block diagram 

The transmitted signal waveform is [15]: 

u m (t) = A mc gT(i t)cos(2 7rf c t) + A ms gr(t)sin(27rf c t), m=l,2, .... M (2.1) 
For 16-QAM, M=4, The signal can be represented in the form of a two- 
dimensional vector s m =[A mc A ms J. The following figure illustrates the state mapping 
and serial input timing. 
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Figure 2-3 16-QAM constellation and timing 

Four bits are mapped into one of sixteen I-Q vector states, and the four bits are 
Gray coded. I is the in phase component, and Q is the quadrature component. 

2.2 Filter Model 

In the simulation, both the pre-filter and the post-filter are modeled as 6th- 
order low-pass Butterworth filters with cutoff frequency equals to the symbol rate of 
s(k). The magnitude response of Butterworth lowpass filters is maximally flat in the 
passband, it is an all-pole filter. For an Nth-order lowpass filter, the first ( 2N-1 ) 
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derivatives of the magnitude-squared function are zeroes at 0=0. The magnitude- 
squared function decreases monotonically with O in the passband and the stopband. 
The magnitude-squared function for a continuous-time Butterworth lowpass filter is 
of the form: 


mm 2 = 


i 

i +(jQ/jQ c r 


( 2 . 2 ) 


The squared function is sketched as Figure 2-4: 

Butterworth filter, N=2, 4,6,8 


\Hc0n>\ 2 



Figure 2-4 Butterworth filter frequency response 
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As the filter order N increases, the filter characteristic curve becomes sharper, 
the roll off rate is about -20NdB/decade. For all N, the magnitude-squared function 
equals to unity at £2=0, at the cutoff frequency £2 C the magnitude squared function 
equals to 1/2. The impulse response of <5 -order Butterworth filter is: 



Time (multiples of T=1/fc) 

Figure 2-5 Butterworth filter impulse response 


2.3 TWTA model 

The High Power Amplifier (HP A) used in the satellite communication is 
usually a Traveling Wave Tube Amplifier (TWTA). The TWTA is modeled as a 
nonlinear memoryless frequency-independent amplifier, it has the effect of nonlinear 
distortion in both the amplitude and the phase [16]. 
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Assume the normalized baseband input to the TWTA is x = pxP e , where p is 
the amplitude and <9 is the phase of the baseband input signal. The TWTA will exhibit 
AM/AM and AM/PM conversion effects. The normalized baseband output of the 
TWTA will be y = AxP^ J0 , according to the model due to Saleh [16], the amplitude 
A and phase shift <t> is given by: 


A(p)= 


OaP 

1 + /? a P 2 


(AM/AM conversion) 


(2.3) 


<*>(p)= 


1 +/V 2 


(AM/PM conversion) 


(2.4) 


For very large p, A(p) approaches ajp a p, and 0(p) approaches constant 
In the simulation, the parameters are set as the following due to Saleh [16]: 
a a = 1 .9638 a </, = 2.5293 

p a = 0.9945 p 4 = 2.8168 

The plot of the amplitude and phase conversion with the parameters above is 
shown in Figure 2-6. The input amplitude is normalized so that the saturation point is 
at unity. In applications, the TWTA is driven around saturation to achieve high power 
efficiency. 
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Figure 2-6 TWTA input-output curve 

For power efficiency considerations, the 16-QAM constellation is chosen such 
that the average amplitude is 1, which drives the TWTA at the saturation point. The 
channel modeled as Figure 2-1 will have two effects on the QAM constellations: 
warping, which is due to the nonlinearity, and clustering, which is due to the ISI 
introduced by the filtering [17]. At the receiver side, the signal constellation is not on 
rectangular grid as 1 6-Q AM, the points at outer comer are compressed, so the outer 
constellation forms a circle. Due to the Intersymbol interference (ISI) caused by 
filtering, each constellation point becomes a cluster, the plot in Figure 2-7 is an 
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illustration of the effect of warping and clustering of this channel in noise free case for 
16-QAM. 



Figure 2-7 the effect of warping and clustering of the channel 
2.4 Volterra Model 

Note that although the source of nonlinearity (TWTA) is modeled as 
memoryless, the effect of the filters in the channel model will cause ISI, and will 
make the overall channel a nonlinear system with memory. The Volterra model 
approach provides a useful description on the nonlinear channel with memory. 
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Assume the transmitted signal is j*, the continuous baseband waveform after 
pulse shaping is 

sith^sMt-kT,) ( 2 . 5 ) 

k 

Where p is the pulse shaping function. After passing the pre-filter which has 
an impulse response of h/(t), it becomes 



(s * h, )(t) = Y J s dP*hP(t-kT s ) = Y j s k h,,- k 

k k 

(2.6) 

where 

h\,k —(p*hi)(t) 

(2.7) 

and 

h\,n-k = (p*hi)(t-kTs) 

(2.8) 


The signal in equation (2.6) is the input waveform to the TWTA. Consider the 
gain function of the TWTA in equation (2.3), its Taylor series expansion can be 
expressed as: 

A(p) = a a p-a a p a p 3 +a a [i a 2 p 5 -a a fi a 3 p 7 +... (2.9) 

so the amplitude of TWTA output is 

A(p) = A[(s*h ] )(i)] = a a Y J S k h x „_ k -a a fi a Y i s , s J s t hln -' h '-"- k 

k i,j,k 

+ <* a Pa Z S i S j S k S K S m h\,n-i h\, n -j h\ t n-k h\,„- m (2.10) 

Uj>kJ 9 m 

TWTA output A(p) is then passed through the post-filter which has an impulse 
response of h 2 (t), and the output is: 
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where 


S(f) = A(p) * h 2 {t) = 4[(s * /.,)«)] * h 2 (t) = X s.hll + X +... 

* i.j.k 

( 2 . 11 ) 

h Z = « a ft V-* * h 2 , h™ n _ ]n _ k = -a a /? a /i i.n-i hm-j h ln -> *h 2 (2.12) 

and so on [10]. 


Sampling s(f) at«r 5 to get the discrete time outputs,, yields: 


‘-I 


s.h 


( 1 ) 


+ 


i.j.k 


s ,s l s k h { n 3 ] n . 


■2 




(1) 


+ 


(3) 


, S n-i S n-j S n-*hijk 


i.j.k 


(2.13) 


The terms h n . in _ ]n J 3> 2 ... are called the Volterra kernels, the 

characteristics of the system is decided by the Volterra kernels [2], The stronger the 
nonlinearity is, the more Volterra kernels are needed to adequately characterize the 
system, and the length of the channel memory will decide how many terms are needed 
in the summation over index i, j, k, and so on [8]. 

There exist both linear ISI (first order term) and nonlinear ISI (third and higher 
order terms) in this model. Only odd-order kernels appear, this is due to the nature of 
the source of the nonlinearity — TWTA. 

Similarly, the phase function of the TWTA also has a Volterra model, 
although we only described the amplitude here. 


Equation (2.13) suggests that the output is a nonlinear function of the input 
discrete time signal Sk and its neighboring symbols. Compared with voiceband 
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channels, satellite channels may have stronger nonlinearity but shorter memory [8], so 
the function will only involve a few terms centered around k : 

Sk — f (S k+(v1) S k , ■-•S k _ m ) (2.14) 

where n and m are decided by the memory length of the channel. 

This can be considered as a mapping from some neighboring input signals to 
an output signal, the mapping is characterized by certain Volterra kernels. 
Recognizing that the source symbols are drawn from a finite alphabet, we see that the 
mapping can be implemented by a look-up table — random access memory (RAM), 
with the input signals as addresses, and the corresponding output as the data content 
accessed by that address. This leads to the Finite-State Machine (FSM) model of the 
channel as described below: 



Figure 2-8 FSM model of the channel 

The channel is characterized by a deterministic RAM and a noise adder, the 
current outgoing signal s k and its neighboring signals decide the state of the model, r k 
is the received signal. Additive white Gaussian noise is included. 
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CHAPTER 3 PERC ALGORITHM 


3. 1 Scheme outline 

The model described in Figure 2-1 is applied in the simulation. A Pre-cursor 
Enhanced RAM-DFE Canceler (PERC) structure is used at the receiver side to 
compensate for the nonlinearity and ISI of the channel. As stated at the end of 
Chapter 1, the received signal is modeled: 

r k —Sk+V k — f (s k+n1 , ..., s k , ••■s k _ m )+v k (3.1) 

where v* is the additive white Gaussian noise of the channel. 

Define the difference between the desired signal (the signal sent) and the 
received signal as g\ 

§(Sk+n-L ■■■> •••) Sk-rrJ Sk~(f(Sk+n-h •••> Sh ■■■> ^k-nx) (3.2) 

The receiver needs to compensate for g(.) to recover the desired signal. Notice that, 
as f(), g() is also a function of $*+„./, ..., s*, .... Sk- m , it can be implemented by a RAM 
look-up table at the receiver side. For synchronization considerations, the RAM 
structure is used in conjunction with a Fractionally Spaced Equalizer (FSE). The FSE 
helps to fix the delay of the channel and removes linear ISI. In the real-data 
simulation in Chapter 5, we will see that the FSE will also cancel the rotation effect 
on the demodulated signal caused by the delay in hardware. The idea of adaptive 
filtering and FSE will be introduced later in this chapter. The overall structure of 
receiver is suggested in Figure 3-1 : 
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A 



Figure 3-1 structure of the receiver 

FSE is the fractionally spaced, linear feed-forward equalizer, and Q is the 
decision device. The RAM table implements the mapping defined by the function 
g(.), with address lines Sk+n-i, ■■■, Sk-m ■ $*+«-/> ■■■■, $k are called pre-addresses, and Sk-i, 
Sk-m are called post-addresses. 

The function f() and g(), are decided by the characteristics of the channel, to 
implement the mapping g(), thus the characteristics of the channel need to be known. 
This is achieved by sending training sequence. The idea of training is to send a 
sequence that is known by the receiver, then by comparing what the receiver receives 
and what is actually sent, the receiver can get knowledge about the characteristics of 
the channel. When the channel is relatively stable, and we can assume it will not 
change before the next time we “train” it. After training, the parameters of the FSE 
and the RAM are fixed, and the receiver works at a fixed mode for data transmission. 
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However, when the receiver is working in the fixed mode for data 
transmission, the pre-addresses s*, ..., St+n-i are not known, we will focus on how to 
get the correct local decision of these pre-addresses later in section 3.4. The taps of 
the FSE are set adaptively, the algorithm used is Normalized Least Mean Square 
(NLMS) algorithm. We will discuss adaptive filtering issues next. 

3.2 Adaptive filtering 

The adaptive filter is a self-designing device, which operates in a recursive 
fashion, and “makes it possible for the filter to perform satisfactorily in an 
environment where complete knowledge of the relevant signal characteristics is not 
available.” [18] It starts with some initial conditions, following a certain algorithm, 
after some iterations, converges to the optimum Wiener solution. 

One of the applications of adaptive filtering is “inverse modeling”. The 
adaptive filter works as an equalizer, it acts as the best fit (in some sense) of the 
“inverse” of the channel. Ideally, the overall effect of the channel and the equalizer is 
a pure delay. The diagram of the inverse modeling is shown in Figure 3-2: 



Figure 3-2 adaptive inverse modeling 
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u(n) is the received signal, y(n) is the output of the equalizer, , let w=[wo ... 
wm_i ] be the filter coefficient vector, u(n)~ [u(n) ... u(n-M+l)] be the input vector, 
so the output is: 

y(n)-w H u(n) (3.3) 

The difference between the desired signal d(n) and y(n) is used to adjust the 
coefficients of the filter. The adjusting algorithm is Normalized Lease Mean Square 
(NLMS) algorithm, which is a normalized version of Lease Mean Square (LMS) 
algorithm. 

3.3 LMS algorithm and NLMS algorithm 

3.3.1 Wiener Filter 

The output of the equalizer y(n) provides an estimate of the desired signal 
d(n). The difference between them is: 

e(n)=d(n)-y(n) (3.4) 

The goal of adjusting the coefficients of the filter is to minimize the mean- 
square error (MSE). The MSE J is defined as: 

J=E[e(n)e(n)*J=E[\e(n)\ 2 ] (3.5) 

where denotes for the expectation. 

J=E[e(n)e(n)*] 

=E[(d(n)-y(n))(d(n)-y(n)) *] 

- o/ -w H p-p H w + w h R w (3.6) 
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2 9 9 T 

where <7d is the variance of the desired signal, w=[wo ... wm-iJ is the filter 
coefficient vector, and u(n)= [u(n) ... u(n-M+ 1)] T is the input vector, p=E[u(n)d*(n)] 
is the Mxl cross correlation vector between the input vector u(n) and the desired 
signal d(n), and R=E[u(n)u H (n) ] is the MxM correlation matrix of the input vector. 


It can be proved [18] that J will be minimized when 



w=R' 1 p= w a (optimal Wiener Solution) 

(3.7) 

and 

Jmin~^d ~ P R P- 

(3.8) 


As an example, Figure 3-3 shows the error surface of a two-tap ( w=[w 0 wjJ) 
filter, mean-squared error J versus tap w 0 and w t . The bottom of the surface is J min , 
the corresponding tap vector w 0 is the Wiener solution. 



Figure 3-3 error surface 
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3.3.2 Steepest Descent Algorithm 

When M (the order of the filter) is large, it is not convenient to compute the 
inverse of R. The steepest descent algorithm is an algorithm that will search the error 
performance surface J for its minimum point J min , and the corresponding filter 
coefficient vector will be the Wiener solution w 0 . 

The steps of the Steepest Descent Algorithm are: 


Initialize w(0) 

For n=0 to L-l 

compute VJ (gradient vector) 
w(n+I)= w(n)+p’[- VJ] 
end; 


where VJ=-2E[u(n)e*(n) ] (3.9) 

The update equation for steepest descent algorithm is 

w(n+l)= w(n)+ju’2E[u(n)e*(n)] 

= w(n)+juE[u(n)e*(n)J 

= w(n)+ju[p-Rw(n)] (3.10) 

To be stable, the stepsize should satisfy 0<ju<2/H max , where X max is the largest 
eigenvalue of the correlation matrix R. 
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3.3.3 LMS algorithm 


The shortcoming of computing the Wiener solution and the steepest descent 
algorithm is that they both require knowledge of R and p , which are usually unknown. 
LMS algorithm uses the instantaneous values as estimates of R and p. 


R = u(n)u H (n) 

(3-11) 

p = u(n)d*(n) 

(3.12) 

The update equation for LMS is: 


w(n+l) = w (n)+jiu(n)e*(n) 

(3.13) 


The steps for LMS algorithm are: 

Initialize w (0) 

for n=0 to L-l 

e(n)=d(n)- w H (n)u(n) 
w (n+I)= w (n)+juu(n)e*(n) 

end; 


If we choose 0<ju<2/A max , LMS algorithm converges in mean, 
lim E[ w (n)J=w 0 if we choose 0<ju<2/tr(R), “tr” denotes the trace of a matrix, LMS 
converges in mean square, we have 

J(n)=J mm +J e fn) (3.14) 

and \vmJ ex = constant. 
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3.3.4 NLMS 


* * + 

In the LMS algorithm above, we have w (n+l)= w ( h)+juu(n)e (n), the 
correction term /uu(n)e (n) is directly proportional to u(n), which means the 
correction is sensitive to the power of the input signals. This will affect the 
convergence rate especially when the power range of input signal is large. To make 
the convergence rate independent of the input signal power, we normalize the 
correction by the input signal power, so the update equation becomes 

w (n+l)= w (n)+ Mu(n)e*(n)/\\u(n)\\ 2 (3.15) 

where \\u(n)\\ 2 = u T (n)u(n). We can think of NLMS algorithm as a special case of 

LMS algorithm which has a time-varying stepsize 

iu(n)=»/\\u(n)\\ 2 (3.16) 

To avoid numerical error problems when \\u(n)\\ 2 is small, a small constant a is added 
to the denominator, the equation becomes: 

w (n+l)= w ( n)+ fiu(n)e*(n)/(a+\\u(n)\\ 2 ) (3.17) 

Typically, a is chosen to be 0.01 . 

To achieve convergence in the mean square, it is required that 0< n<2, NLMS 
algorithm converges faster than LMS algorithm especially when the data is correlated 
(large eigen spread) for same level of steady-state misadjustment. 
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3.4 Fractionally Spaced Equalizer 


3.4.1 FSE theory 


A typical communication system can be modeled as: 



Figure 3-4 communication system model 

Where s(k) is the source sequence with symbol interval T, assume the D/C converter 
is ideal: 


m=%S(t-t cT)s(fc) 

/=-<* 


(3.18) 


and the received waveform is 


r( 0 = l>(/)c(f - ,T) 

r=-Qc 

r(t) is sampled at rate f s -L/T, 


(3.19) 


r(kT / L) = £ s(i)c(kT I L - IT) 

/ = -00 ^ ' ' 

A sequence is called baud spaced when the interval between symbols is T 
units of time. In the above model, s(k) is baud spaced, when L= 1, the received 
waveform r(t) is sampled at the same rate as the symbol rate of the baud spaced 
source sequence s(k), the equalization is called baud spaced equalization. When L is 
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an integer larger than 1, the equalization above is called fractionally spaced 
equalization. The channel and equalizer taps are T/L spaced. 

The system can be viewed as a multi-channel model, i.e., there are L parallel 
sub-channels, the delay between adjacent sub-channels is T/L. The I th subchannel is: 


c,(k) = c(kT+!L) 


(3.21) 


The signal after sampling the output of the I th subchannel is: 
r(T(n +-£)) = £ s(i)c((n - i)T ) + lj) = £ s(/)c,(n - /) (3 22) 

A common choice for L is 2, when L-2: 


(3.23) 

Denote the even and odd subchannels by c even and c odd , the even and odd samples of 
the received signal are: 


00 00 

rr = H»T) = X S(i)c((n - i)T) = £ (3 24) 

/=- oo /=— oo 


'-r = rinT~) = !>(*>((« - Or - f ) = fXiV" (3 25) 

" /=— oo " /=— oo 

Suppose the length of the equalizer is Z/, and denote the taps equalizer by /, 
the output of the equalizer is: 


y(4>=i;V(/i)r((/c-/)i) 

^ /=n ^ ^ 


(3.26) 
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It is decimated by 2 to get the baud spaced output, without loss of generality, retain 
even samples: 

4-' T T (4-U/2 7 - 7 - 

y(nT) = j - /-) = £[f<,T)r(( n - OT) + tyT + ±)r((n - i)T - -i-)] 

]=o Z. Z /=o Z Z 

(3.27) 

Equation (3.27) suggests that the equalizer can also be viewed as being 
composed of even and odd subequalizers, denote them by f even and f odd , we have: 

(£/-!)/ 2 

V / seven ^even , sodd „odd \ 

y\ n n= r +f r ) (3.28) 

(=0 

The output y(nT) will be the sum of the output of even and odd sub-equalizers. The 
above equations suggest that the following structures are equivalent: 
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s{k / 2), keven 

In model (*), s’(k)=\ q ,kodd (3.29) 


3.4.2 FSE training 

For BSE equalizers, when the channel is Finite Impulse Response (FIR) filter 
with length L c , the length of the equalizer needs to be several (usually 3-5) times of 
L c . This is not necessary for FSE, the length of the equalizer Z/ only need to satisfy 
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Lf>L c -l. Lf is the sum of the length of the even sub-equalizer Me and the odd sub- 
equalizer Mo, Me = f(Lc-l)/2 7 Mo=i(Lc-l)/2j fx 7rounds the elements of x to the 
nearest integers towards infinity, /xV rounds the elements of x to the nearest integers 
towards minus infinity [19]. 

The FSE is trained to set the coefficients of the filter, the training process of 
FSE is described in the diagram below. 



Figure 3-6 FSE training diagram 


The FSE is adapted via NLMS algorithm, and the feedback error signal used 
to adjust the coefficients is 

e(k) =d(k) -y(k) =d(k)-(y e (k) +y 0 (k)) (3.30) 

The taps updating process stops when the MSE does not decrease any more. 
Experiments are needed to be done to decide the proper delay, the scheme is 
to test over possible delay values and check the MSE at the end of the adapting 
process. The delay corresponding to the minimum MSE is finally set to be the correct 
delay. When the number of possible delays is big, to avoid testing all delays, 
typically the delay may be set to somewhere in the middle of the FSE, even though 
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this may be non-optimal. The length of the filter is chosen by a trail and error way, 
i.e., starts from a short equalizer length, then increase the length and check the MSE, 
the equalizer length is chosen such that if continue to increase the length, the MSE 
will not decrease. 

The advantages of FSE are: compared with BSE, which has one tap per 
symbol, FSE ( L=2 ) has two taps per symbol, and the channel is also sampled twice 
the sampling rate of BSE, this helps to avoid aliasing. Also, by doubling the sampling 
rate, FSE is less sensitive to the sampling clock phase. Another advantage is that the 
length of FSE taps can be shorter than BSE, and it works much better than BSE when 
the channel has zeros close to the unit circle. After FSE adapting process, the taps are 
fixed, it will be working in a fixed mode, and the training process for the RAM 
begins. 

3.5 RAM structure for PERC 

The RAM will compensate for the difference between the output of FSE y\ 
and the desired signal Sk, which is a delayed version of the transmitted 16-QAM 
symbols. The content of the RAM is setup by training, i.e., comparing the output of 
FSE and the desired sequence, the delay of the desired sequence has been decided by 
the FSE experiment mentioned earlier (the delay with minimum MSE). The diagram 
for RAM training is: 
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Figure 3-7 RAM training diagram 

The symbols s k + n +i> .... s k +i, Sk ..., s k . m form the address lines. Denote the 
address vector by A=[ s k + n +i, ■■■, s k +i, Sk ■■■, s k - m J Denote the data content accessed by 
the address “ A ” by RAM_data(A), which compensates for the nonlinearity and 
nonlinear ISI. Assume a symbol s k is only interfered by its neighboring symbols 
Sk+n+i, ■■■, Sk+i, Sk-i, .... Sk-m (with yi, m be finite integers), when the channel is time- 
invariant and noise free, after the FSE taps are fixed, the FSE output y k is uniquely 
decided by S k + n +j, ■■■> S k +]Sk ■■■> ^k-mi y k ~y(Sk^n+ 1 ’ •••, Sk+l, Sk Sk-m)i let 

RAM_data(A)=s k -y k = Sk-y(s k+n+ i, ■■■, s k+1: s h .... s k .J (3.31) 
refer to the structure in Figure 3-1, the signal before the decision device is: 

z k =y k +RAM_data(A) =y k +s k -y k =s k (3.32) 
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The symbol Sk is perfectly recovered. There are 16 n+m possible values for the 
address “A”, which also means a RAM with size 16 n+m is needed to store these data. 
As the compensation is uniquely decided by the n+m address lines, to fill up the 
RAM, we only need to generate all the 16 n+m possible address combinations, and 
calculate the data content. 

However, two factors need to be taken into consideration. First, in a real 
channel, y* is not only affected by Sk+ n +i, ..., Sk+i, Sk, Sk- m , symbols out of this range 
will also interfere Sk (with a less scale). Second, there exists additive noise in the 
channel. Due to them, with same Sk+„+i, .... Sk+i, Sk, .... Sk-m, the corresponding y>k and 
RAM_data(A)=Sk-yk may not be unique, they form clusters. A reasonable way to get 
the best data content for address “. A ” is to average many “sk-yk ” s and take the centroid 
of the cluster. 

Two RAMs with the same size 16 m+n and same address lines are involved, 
they are both set to 0 initially. One works as a dataJRAM, the other works as a 
counter RAM which record how many times each address has been accessed. Each 
time a certain address “A” is accessed, the corresponding “ Sk-yk ” is accumulated to the 
data_RAM: 

RAM_data(A)=RAM_data(A)+Sk-yk (3.33) 

At the same time, the content in address A in the counter_RAM is increased 
by 1 . When the training process finishes, divide data_RAM by counter_RAM in an 
address-by-address manner to get the average, and put the result back to the 
data RAM. 
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After the look-up table is setup, the FSE coefficients, and RAM contents are 
both fixed, and the system works in a fixed-mode for data transmission. The diagram 
for fixed-mode is: 



Figure 3-8 PERC block diagram 

In data transmission, the receiver will decide the address lines by making local 

A 

decisions on the received symbol sequence. At time k, previous local decisions S*-i , 

A 

..., Sk-m are used as post-addresses, the pre-address S* , s* +n -i are decided by 
testing over all possible pre-address combinations, and pick the best combination, 
then the corresponding testing pre-address for current symbol s* is decided as the 

local decision S k for the current symbol. 
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The process to select the best pre-address combination is following: After 
fixing the post-addresses by using previous local decision, there will be 16" possible 
addresses “ A ” because of the 16 n possible pre-addresses combinations. The 
corresponding RAM contents causes z* ~yk + RA M_data(A ) having up to 16 n different 
values, choose the z* that is closest to the 16-QAM constellation grid point that is 

assumed for pre-address s k as the best one, i.e., calculate the distance function 

distrW 0 - s„ V) | (3.34) 

Where /=/ to 16 n , s k 0) is the current symbol cursor assumed by the i-th pre- 
address test, and z™ is the sum of yu and the RAM content accessed by the i-th pre- 
address test. The pre-address combination with the smallest dist, is chosen to be the 

best pre-address combination, and the corresponding address line for s* is decided as 
the current symbol. Mathematically: 

s* = arg(min|z/° — s k {i) \) (3.35) 

The performance of the algorithm will largely depend on the choice of n and 
m, how many neighboring symbols we are using will decide how completely we are 
modeling the channel. The disadvantage of this scheme is that the size RAM may be 
very large, and to increase n or m by 1, the RAM size will be 16 times larger. 
Experiments are done for the proper values of n and m. Denote the algorithm with n 
pre-addresses and m post-addresses by PERC(n,w). 
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CHAPTER 4 SIMULATION RESULTS 


The simulation results of PERC algorithm are shown in this chapter. 
The simulation programs are written in MATLAB. The simulation process 
includes the training mode and the fixed-mode. The training mode sets the taps 
of FSE and the content in RAM, and the fixed-mode works for data 
transmission. 

The 16-QAM constellation is normalized so that the average amplitude 
of the constellation points is 1. A random 16-QAM symbol sequence is 
generated as the transmitted symbols. The 6-th order Butterworth analog filter 
is implemented by its digital approximation via bilinear transformation. And the 
digital filter is normalized so that the passband gain is 1, this guarantees the 
input to the TWTA has an average amplitude of 1, which will drive TWTA in 
saturation region. The received signal is the sum of the post-filter output symbol 
and the discrete time Additive White Guassian Noise (AWGN) component. For 
16-QAM, the energy of one symbol is 4 times the energy of one bit, so the 
symbol-energy-noise-spectral-density-ratio ( Es/NO ) is four times the bit-energy- 
noise-spectral-density-ratio ( Eb/NO ), i.e., Es/N0=4Eb/N0. 

The data at different stages of the receiver will be shown in this chapter, 
for Eb/NO l4dB case. The received data is shown in Figure 4-1 (10000 data 
points included): 
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Figure 4-1 the received data 

The training process for FSE is performed first, the input to FSE is at 
twice the symbol rate and the output of FSE is at the symbol rate. The training 
finishes when the MSE comes to the floor, Figure 4-2 shows the learning curve 
of the training process: 
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number of iterations, n 


Figure 4-2 learning curve of FSE 

After FSE training, the FSE taps are fixed, the training process for RAM 
described in Figure 3-8 begins. To do the averaging, we need to access each 
address of the RAM many times. For comparison, each address will be accessed 
15 times in average for all PERC algorithms, i.e., for PERC(«,m), the training 
sequence for RAM has the length of 15xl6 m+n . 

When the training period ends, the system runs at a fixed mode, Figure 
4-3 shows the output data of the FSE. 
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Figure 4-3 data after FSE 

Since the FSE is a linear equalizer, it will remove (or partially remove) 
the linear ISI in the channel, the variance of the clusters becomes smaller. 
However, the center of the clusters are still not at the 16-QAM rectangular 
constellation grids, the nonlinear transformation and nonlinear ISI caused by the 
power amplifier and the filters are not removed. They will need to be further 
compensated by the look-up table -- RAM. The data after RAM compensation 
(also the input to the decision device) for PERC schemes with different n, m are 
compared in Figure 4-4 (1M data points included): 
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PERC(0,2), which does not involve pre-address, fails to provide 
acceptable improvement in canceling the nonlinearity. By involving one pre- 
address, PERC(7,2) cancels the nonlinearity and recovers the 16-QAM 
constellation, the variance of the clustering also becomes smaller (refer to the 
MSE of the data points). PERC(2,2) includes two pre-address lines, it removes 
the clustering effect more. That the variance of the cluster (MSE of the data) is 
small does not necessarily mean that the error rate is small, because the RAM 
compensator may have moved an incoming symbol closer to an wrong 
constellation grid, the BER versus Eb/NO performance needs to be studied on, 
and the results are shown in Figure 4-5 for PERC algorithm with different n and 
m. The theoretical AWGN channel BER is also presented for reference. 




Figure 4-5 BER vs. Eb/NO 

If a decision error happens, we assume that the wrong decided point is 
among the closest neighboring point of the correct one. Assume the 16-QAM is 
Gray coded, which means there is only 1 bit difference between the closest 
neighboring points on the constellation. Based on the two assumptions above, if 
a symbol is wrong decided, we can assume that only 1 bit is wrong, so the 
symbol error-rate is four times the bit symbol rate for 16-QAM. The BER in the 
above figure is obtained by dividing the symbol error rate by 4. 

PERC(0,2) fails to remove the nonlinearity, and its BER are at the level 
around 5% for all Eb/NO cases, this is because the nonlinearity is the dominating 
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factor for BER compared with the noise. For PERC(7,7), PERC(7,2) and 
PERC(7,5), as the post-address lines increase, the BER performance is 
improved. The calculation in fixed mode for them are the same, and they test 
over 1 6 possibilities to make decision on one symbol, the cost is that the RAM 
size is increased. PERC(2,2) has two pre-address lines, the RAM size is the 
same as the PERC(7,J), and PERC(2,2) needs to test over 16 * 16=256 possible 
cases on one symbol decision. From the simulation for this channel, its 
performance is similar to that of PERC(7,2). In general, increasing pre-address 
by one will cause the calculation in fixed mode to increase 16 times, but the 
performance does not improve much, so it is not efficient to include more than 1 
pre-addresses in this channel (but may be helpful for other channels). 

The average times the RAM is accessed in the training mode will also 
affect the BER performance, the conclusion about how many access times will 
be proper is not included in this thesis, however, comparison for PERC(7,2) 
with different RAM access times is shown in Figure 4-6 for reference: 
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Figure 4-6 RAM access times comparison 



CHAPTER 5 HARDWARE DATA COLLECTION AND 


SIMULATION 

5. 1 Hardware setup description and data collection 

Hardware is setup to collect the data from both ends of the TWTA hardware 
channel. The parameters for the software simulation in Chapter 4 and the parameters 
for hardware setup described in this Chapter are set as close as possible to each other 
for comparison. 

The hardware system is mostly composed of the Hewlett-Packard (HP) 
equipment and Mini-circuit components. A traveling wave tube amplifier (TWTA 
801 OH) is included which contributes for the nonlinearity. A pre-filter and a post- 
filter are connected to TWTA, which causes 1ST The TWTA works in 5-band (1550- 
5200MHz), the pre-filter and post-filter should be passband filters centered at the 
frequency of the RF carrier, however, it is hard to find bandpass filters at this 
frequency range, so instead of using bandpass filters for pre- and post- filtering, 
lowpass filters are used at baseband, pre-filtering is done before up-converting the 
signal to 5-band, and post-filtering is done after down-converting the signal to 
baseband. The general idea of the hardware setup is shown in the Figure 5-1 . 




Figure 5-1 hardware setup diagram 

The modulator is implemented by HP 8782B Vector Signal Generator, the 
Pseudo Random Bit Sequence (PRBS) can be generated by the equipment internally, 
the bit rate is 10Mbps. HP 8782B will implement the 16-QAM modulation, and 
provide the I, Q baseband sequence output. The I, Q symbol rates are 2.5M symbols 
per second. The HP 8981 A Vector Modulation Analyzer works as a demodulator, it 
will extract the I, Q baseband sequence from the received waveform. The Mini- 
Circuit components BLP-1.9 are used as filters, with a 3dB cutoff frequency of 
2.5MHz, the symbol rate. The RF carrier is 2GHz sinusoidal waveform. The data 
collection is done by the LeCROY digital oscilloscope 9384. Figure 5-2 shows the 
hardware connection diagram: 
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Figure 5-2 hardware connection diagram 


The output of TWTA is connected to a coaxial directional coupler (Narda 
Corp. , Model: 3003-10 with a serial number 21540) and then to the load (N 9525 
ARRA load with serial number of 858) to make the TWTA work properly. Due to 
the linear working range of the frequency mixer, the signal after up-converting can 
not drive the TWTA into the saturation region, a combination of Solid State Power 
Amplifier (SSPA ZHL-42) and 20dB attenuator is used to pre-amplify the signal so 
that the signal will drive the TWTA in saturation region. To make the down- 
converter work in linear area, a 50dB attenuator is applied to attenuate the output of 
the TWTA. The working range of all the components in this circuit are carefully 
studied and measured, to guarantee that only the TWTA works in saturation area, 
other devices work in linear area. 
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Notice that the baseband pre- and post- filtering is applied to baseband I and Q 
signals respectively for convenience. The details in the dotted square in Figure 5-2 is 
described in Figure 5-3: 



BLP-1.9: Low pass filter, ldB cutoff 1.9MHz, 3dB cutoff 2. 5MHz 


Figure 5-3 modulator and demodulator connection 

The BLP-1.9 lowpass filters are connected as the external filters of HP 8782B 
and HP 8981 A, and they serve as the pre- and post- filters of the TWTA. Data 
collection is made before the pre-filter and after the post-filter. 

The coherent carrier of the modulator is 100MHz, it is fed to the demodulator 
as reference carrier for demodulation. The function generator provides a 10MHz 
clock signal, it is connected to the external clock of the HP 8782B, which triggers the 
pseudo-random sequence generator inside the HP 8782B Vector signal Generator. 
The 10MHz clock is also connected to the external sampling clock of the 
oscilloscope, which is the clock to record the data, so the sampling clock is four-times 
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the rate of the symbol, and it is synchronized with the symbol. For PERC algorithm, 
the sampling rate only needs to be twice the symbol rate to perform the FSE, the need 
for oversampling is due to the external clock frequency requirement for the 
oscilloscope. 

The data collected at the oscilloscope is the I, Q baseband signals at the 
transmitter and the receiver side. The sampled data (1M points for each of the four 
data sequence) can be saved as a binary file in a floppy disk mounted on the 
oscilloscope, and can be converted to an ASCII file by a software “wavetran.exe” 
provided by LeCROY company. These are the raw data for the simulation in this 
Chapter. 

The details about the setup of each equipment and the connections are listed in 
Appendix A. 

5.2 Hardware generated data results 

From the data collected by the oscilloscope, the transmitted data generated by 
TIP 8782B is shown in Figure 5-4: 
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Figure 5-4 the transmitted signal (hardware) 

The “noisefree” (without injecting noise) received baseband data extracted by 
HP 8981A is shown in Figure 5-5: 
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Figure 5-5 the received signal (hardware) 

Practical factors need to be considered to perform the schemes described in 
Chapter 3 on the hardware collected data. Regarding Figure 5-4, due to the noise in 
the modulator and the cable, the transmitted data collected by the oscilloscope forms 
clusters around the constellation grids of 16-QAM. The constellation grids are gotten 
by taking the average of the points in each cluster. To remove the clustering, the raw 
data were quantized to their closest constellation grids. Then the quantized data are 
considered as the transmitted data and are fed to the program. 

Notice that there are offsets in both the transmitted data and the received data 
plots, the constellations were moved to the upper-right comer, this is caused by the 
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oscilloscope, which is the data-collecting device. The overall vertical accuracy of the 
oscilloscope mentioned by the manual is lOmv, the inaccuracy causes a DC offset 
when recording data. Before performing other receiver algorithms, the offset is 
subtracted from all data points for both sent and received data. The offset is obtained 
by averaging all the data points. After removing the offset, the center of the data 
points is on the origin point, and the constellation is symmetric about origin. 

Another practical factor is the phase offset in the demodulated signal, it is 
caused by the delay in the hardware, and its effect is a rotation on the demodulated 
baseband received data. From hardware measurement on the demodulator, the 
rotation angle is in/8. After subtracting the offset from the received data, the rotation 
is centered at origin, so it carries on a linear transformation on the received data, the 
linear equalizer FSE can cancel the rotation effect, experiment also verified this. The 
length of FSE and the delay is decided by the method in Chapter 3. The length is 
chosen such that if continue to increase the length, the MSE will not decrease, and the 
delay is the one with minimum MSE. The final choice of FSE length is 20 taps for 
both even and odd subequalizers, and the delay is 12. Figure 5-6 shows the value of 
the taps. 
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Figure 5-6 Taps of FSE 

In the noisefree case, the output of FSE is given in Figure 5-7: 
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Figure 5-7 FSE output of real data simulation 

Comparing with Figure 4-3, it will be interesting to notice that the hardware 
channel has caused some asymmetry, the reason and the compensation method for the 
asymmetry is not discussed in this thesis. 

To study the performance in a noised environment, discrete time noise is 
injected in a software way, i.e., adding the discrete noise component in program 
instead of injecting noise in hardware. The same processes of training and fixed- 
mode are performed for the hardware collected data. The BER versus Eb/NO 
performance for different PERC schemes is shown in Figure 5-8: 
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Figure 5-8 BER versus Eb/NO for real-data 

For this real channel, similar results can be achieved as in Chapter 4, by 
including one pre-address, the BER versus Eb/NO performance is substantially 
increased, increasing post-address lines is helpful, but more than 1 pre-address lines is 
not efficient. 

Refer to the results in Figure 5-8, PERC(7,J) and PERC(2,2) which have 
larger RAM sizes, do not show better BER performance as expected. This is because 
of the insufficient RAM access times. Due to the floppy disk size, a maximum of 1M 
data points, which is 250,000 symbols can be involved. With these data, the average 
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RAM access times is 250,000/1 6 4 = 3.8 for PERC(/,5) and PERC(2,2). Especially in 
low Eb/NO case, the noise can not be fully averaged by access RAM so few times. 

To compare the software-generated-data simulation and the real-data 
simulation, PERC(7,2) of them is shown in Figure 5-9: 



Figure 5-9 PERC(7,2) for software generated-data and real-data simulation 

To make a fair comparison, the RAM should have the same average access 
times, the real-data collection includes 250,000 symbols, for PERC(/,2,), each RAM 
address is accessed about 60 times, so the generated-data simulation with 60 average 
access times are used here for comparison. Generally, the real-data simulation shows 
better BER performance than the software generated data simulation, which suggests 
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that the nonlinearity and the ISI in the hardware is over-modeled by the model we 
assumed in Chapter 2. In high Eb/NO case, the simulation on the collected data shows 
a higher error-rate than the generated-data simulation, this is because of the inherent 
noise in the hardware equipment, cable and connectors. In high Eb/NO case, the noise 
in the hardware is comparable to the injected noise, they are both contributing to the 
overall noise, which is the reason why the BER versus Eb/NO performance is worse 
than the software generated data simulation with the same injected noise. When 
injecting higher power noise, the injected noise is dominating, and the hardware noise 
can be neglected. 
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CHAPTER 6 CONCLUSIONS AND RECOMMENDATIONS 


This thesis presents a technique to transmit high order modulated (16-QAM) 
symbols through nonlinear memory channel. A RAM-based equalization algorithm 
that operates on the receiver — the Pre-cursor Enhanced RAM-DFE Canceler (PERC) 
algorithm, is described and tested. The algorithm is based on the theory of Volterra 
model of the nonlinear system and the Finite State Machine (FSM) model of the 
system. 

Compared with the algorithm based on the transmitter, the receiver-based 
methods prevent enhancing the transmitter complexity. Among the receiver-based 
schemes, the RAM-based algorithms are robust to the additive noise in the channel. 
The contribution of the PERC algorithm is that it includes pre-addresses to access the 
RAM and suggests a new way to decide the pre-address lines, and the state of the 
FSM model is more accurately defined. Compared with the results provided in [9] 
PERC provides better performance than the Volterra equalizers. 

The PERC algorithm will compensate for both the warping and the clustering 
effects due to the High Power Amplifier (HP A) and the filters of the channel. PERC 
has lower error rate than the linear equalization schemes and algorithms that only 
involve post-cursors as address lines. For the channel model assumed in this thesis, 
involving one pre-address line (the current symbol) increases the performance 
greatly, while further increasing the pre-address lines costs more calculation, but does 
not leads to much improvement in performance. This is because the model we 
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assumed for the satellite channel has short memory. It may be necessary to include 
more pre-addresses for channels with longer memory. Only addition calculation 
(except the averaging operation in the training period) is needed to implement the 
PERC algorithm, it saves hardware, and will be suitable for real-time system. 

The disadvantage of the algorithm is that for channels with long memory or 
for higher modulation schemes, the RAM size will be huge, and to increase the 
address lines by 1, it is needed to increase the size of RAM 16 times for 16-QAM, 
and even more if higher order modulation is considered. There will be error 
propagation for the PERC algorithm because a current wrong decision will be used as 
future post-address lines. 

The work of this thesis is composed of software and hardware part. In the 
software simulations, the channel is modeled by a lowpass equivalent system. 
Comparisons for different PERC schemes are provided. Hardware is setup to collect 
real-world data, and practical factors are considered for the real-data simulation, 
satisfying results are achieved. 

Some recommendations for future study includes: 

• Verify the algorithm using data from actual TDRSS. 

• Consider adjusting the FSE or RAM parameters in data transmission 
period for nonstationary channels. 

• Study on a method for choosing n and m for different channel models. 

• Study the proper length the training sequence. 
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• Study on a method to replace the FSE, and this method can decide the 
delay of the channel without amplifying the noise. 
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APPENDIX A. HARDWARE SETUP AND CONFIGURATION 


Setup and connections for the Modulator (HP 8782B Vector signal Generator): 
Front panel : 

1 . The Modulation on 

2. The RF output on 

3. The PRBS on 

4. Set Modulation to 1 6 QAM 

5. Set the frequency to 1 00 MHz 

6. Set the level as desired (typically, -3dBm to observe the nonlinear distortion) 

7. EXT FILT on 

8. PRBSCLKon. 

9. The SERIAL/PRBS is connected to the external clock for the PRBS clock. 

10. The RF OUTPUT is connected to “I” of Mixer ZFM 4212. 

Rear panel: 

1. Use the jumper cable to connect the 10MHz time base IN and OUT 
connectors together 

2. INT/EXT Time Base be set to INT. 

3. COHERENT CARRIER is connected to COHERENT CARRIER of HP 
8981 A at the rear panel. 
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4. BLP-1.9 filter is connected between EXT FILT IN and OUT of I and Q 


respectively. 

5. Signal at OUT of I is connected to the channel 1 of LeCroy Oscilloscope 
9384. 

6. Signal at OUT of Q is connected to the channel 2 of LeCroy Oscilloscope 
9384. 


Setup and connections for the Demodulator (HP 8981A Vector Modulation 
Analyzer): 

Front Panel: 

1 . Set Demodulation to 1 6 QAM 

2. Set REF FREQ to 100MHz 

3. Press DEMOD key, press MORE softkey, set DEMOD INT/EXT to INT. 

4. Press DEMOD key, press MORE softkey, set EXT FILTERS ON/OFF to ON. 

5. Press DEMOD key, press MORE softkey, set RF ON/OFF to ON. 

6. Choose display mode by pressing CONSTL. 

Rear Panel: 

1 . RF IN is connected to “I” of Mixer ZFM- 1 50. 

2. COHERENT CARRIER is connected to COHERENT CARRIER of HP 
8782B at the rear panel. 
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3. BLP-1.9 filter is connected between EXT FILT IN and OUT of I and Q 
respectively. 

4. Signal at IN of I is connected to the channel 3 of LeCroy Oscilloscope 9384. 

5. Signal at IN of Q is connected to the channel 4 of LeCroy Oscilloscope 9384. 

Setup and connections for LeCroy Digital Oscilloscope 9384: 

1 . CHI is connected to I, OUT of the HP 8782 B. 

2. CH2 is connected to Q, OUT of the HP 8782 B. 

3. CHS is connected to I, OUT of the HP 8981 A. 

4. CH4 is connected to Q, OUT of the HP 8981 A. 

5. EXT is connected to the external sampling clock. 

6. Set the COUPLING of each channel to DC 50ohm. 

7. Press TIMEBASE SETUP, set the sample clock to “Ov”, external coupling 
DCIMohm, set sequence OFF, turn the knob, set the record points to “1M” 

8. Insert floppy to the top of Lecroy 9384. 

9. Press STOP to capture 1M points of each channel, press STORE, select “TO 
FLOPPY”, set format to “BINARY”, press “DO STORE” 

Setup for the signal generator (HP 8657) : 

1 . The RF output on 

2. Set the frequency to 2000MHz 

3. Set the amplitude to 12dBm. 


64 



Setup for the Traveling Wave Tube Amplifier (TWTA) 8010H: 


1 . Output of TWTA is connected to a coaxial directional coupler (Narda Corp. , 
Model: 3003-10 with a serial number 21540) and then to the load (N 9525 
ARRA load with serial number of 858). The other output of the coupler, 
which has -lOdB attenuation is connected to later circuit. 

2. Turn on the power, (do not turn on the operatoer switch at this time.)warm up 
for about 1 0 minutes. 

3. Turn on the operator switch. 

4. Do not let TWTA work for more than 15 minutes continuously. Turn off the 
operation switch for 10 minutes, (do not need to turn off the power) then turn 
on and start operation again. 
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APPENDIX B. CHARACTERISTICS OF TWTA AND SSPA 


power curve of TWTA and SSPA 
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APPENDIX C 


CHARACTERISTICS OF THE FILTER 



BLP^SOO 
I BlP-600-75 
BLP-750 
BLP-SOO 

BLP-8SO 
I BLP-6S0-75 

eip-iooo 

BLP-1200 


32.95 
33 95 
3295 

32.95 

32-95 

33.95 
3295 

32.95 


NOTES: 

*■ ldB compression at +13 dBm Input power 

■ Denotes 75 ohm model, for coax connector models 75 ohm 
BNC connectors are standard. 

A. General Quality Control Procedures. Environmental 
Specifications, Hi-Rel MIL and TX description are given In 
General Information (section 0). 

B. Connector types and case mounted options, case finishes 
are given In section 0. see ’Case styles & outline drawings'. 

C. Prices and specifications subject to change without notice 

1. Absolute maximum power, voltage and current rating: 
la. RF power, 0,5 Watt 

2. Models are available with maie/female coax connectors, 
for other configurations and inter-series versions consult 
factory. See section 0. case styles and outline drawings 


NSN GUIDE 
MCL NO. NSN 

SLP-30 5915-01-327-4692 

SLP-21.4 5915-01-414-9165 


! CIV PASS 

TYPICAL TRCQUCnCY RESPONSE 



INTERNET http://wwwminicircuits.com 

I I ■ C U I Iw P.O. Box 3501 66, Brooklyn, New York 11235-0003 (718) 934-4500 Fax (71 8} 332-4661 

Distribution Centers t NORTH AMERICA 800-654-7949 417-335-5935 Fax 417-335-5945 EUROPE 44-1252-635094 Fax 44-1 252-837010 
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CflOUP OCtA* (KMC) oe»uw toss <<*) IHSOUKW LOSS <4S) 


rugged, high selectivity 

Low Pass Filters 


LP-1.9 DC-1 .9 MHz 



PJ-\ ? 

PETUflN LOSS 



MODEL SELECTION 

(choose prefix) 

P = plug in N = typeN 
B = BNC S = SMA 


Fieq. 

!. Loss 

R.L 

Freq. 

G . Delay 


X 

o 

X 


X 

(MHz) 

«*> 

(cfB) 

( dB ) 

(MHz) 

(rwec) 

0.1 

0.09 

0.01 

40.00 

1.0 

390 937 

1.2 

0.24 

0.02 

22.11 

1.0 

393.923 

1.4 

0.26 

0.01 

23.88 

1.1 

397.209 

1.6 

0.31 

0.01 

26.19 

1.1 

400.265 

1.8 

0.40 

0.02 

21.19 

1.2 

406.342 

1.9 

0.45 

0.02 

21.37 

1.3 

410-760 

2.1 

0.75 

0.12 

15.39 

1.3 

417.765 

2.8 

2.93 

0.75 

5.05 

1.4 

425 387 

2.5 

B .32 

1.21 

1.43 

1.4 

434.334 

27 

14.69 

1.17 

0.59 

1.5 

444.369 

28 

17.71 

1.12 

0.45 

1.6 

455.268 

3.0 

2330 

1.01 

0.32 

1.6 

468.790 

3.1 

25.89 

0.96 

0.29 

1 7 

484.497 

3.3 

30. 6 B 

0.90 

0.24 

1.8 

506.645 

3.4 

32.92 

0.87 

022 

1.9 

554.145 

3.6 

37.12 

0.84 

0.20 

1.9 

579.783 

3.8 

41.00 

0.81 

0.18 

2.0 

642.409 

4.0 

44 61 

0.79 

017 

2.1 

720.762 

4 3 

49.67 

0.81 

0 15 

2.2 

779.406 

4.5 

52.78 

0.60 

0.14 

2.3 

768.865 

4.7 

5574 

088 

0,13 

2.4 

675.136 

5.0 

60.00 

086 

0.11 

2.5 

579.639 

29.4 

92.50 

4,17 

0.17 

2 7 

390.341 

S 3.8 

91.19 

2.63 

0.42 

2.3 

286.216 

78.1 

92.39 

2.10 

0.45 

3,0 

23 C .561 

102.5 

87.80 

2.75 

0.46 

3.1 

182.807 

126.9 

89.65 

3.87 

0,47 

3.3 

153.959 

151 .3 

87.77 

2.60 

046 

3.4 

132.224 

175.6 

92.87 

7.14 

0.44 

3.6 

109.190 

200.0 

86.82 

4.04 

0.44 

3.8 

92.795 
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APPENDIX D. SIMULATION SOURCE CODE 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
%sperc!2 .m 
% PERC { 1 / 2 ) 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


qam!6; 

cluster=[3. 96e-3 6.28e-3 0.00995 0.01577 0.025 0.03962] ; 
for index=l : 6 

v=randn {length (x3) , 1 ) +j *randn ( length (x3) , 1) ; 

v=v/ sqrt ( v T * v/ length ( v ) ) *sqrt (cluster (index) *x3 T *x3/length (x3) ) 

yl=x3+v; 

clear v; 

sf setrain; 
sramtrainl2 ; 

v=randn (length (x3) , 1) +j*randn (length (x3) , 1) ; 

v=v/ sqrt ( v * *v/length (v) ) *sqrt (cluster (index) *x3 1 *x3 /length (x3) ) 

ylr=x3+v; 

clear v; 

Lyr=length { ylr ) ; 

yer=conv (conj (we) , ylr ( 1 : 2 : Lyr ) ) ; 
yor=conv (conj (wo) , ylr (2 : 2 :Lyr) ) ; 

yr=yer ( 1+delay : length (s) ) +yor ( 1+delay : length (s) ) ; 

clear yer; clear yor; 
figure 

plot (yr, f .'); 

title { T fixed-mode, after fse T ); 

zr= zeros ( length (yr ) , 1 ) ; 
z_test=zeros (16, 1) ; 
error= zeros (16,1); 
sr=zeros (length (yr ) , 1 ) ; 

sr(l:2)-s(l:2); 
zr (l:2)*s (1:2) ; 

for n=3 : length (yr) 

s2r=f ind { ( sr (n-2 ) '‘'ones (16, 1 ) -Q) “0) -1 ; 
slr=find ( (sr (n-1) *ones (16, 1) -Q) ==0) -1; 

for i=l : 16 

address=s2r-i-slr*16 A l+ (i-l)*16 A 2 + l; 
z_test (i ) =yr (n) +ram (address ) ; 
if ( i==l ) best=l;end; 
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error (i)-abs (zjtest (i)-Q(i) ) ; 
if (error (i) <error (best) ) best=i;end; 
end; % test 16 defferent addresses 
zr (n) =yr (n) + ram ( s2r+slr*16 A l+ (best-1 ) *16 A 2+1) 
sr (n) =Q (best) ; 
end; % all sequence 

figure 
plot (zr, 

title (' fixed-mode, perc'); 
cc=0 ; 

for i=l : length ( zr ) 

if (s (i) —sr (i) ) cc=cc+l; end; 

end; 

error_rate= ( length (zr) -cc) /length (zr) 
end; 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

Iqaml 6 .m 

%generating 16-QAM random sequence 
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

clear 

rand ( ' seed’ , 0) ; 
j =sqrt (-1 ) ; 
index=l ; 

Q=zeros (16,1); 
for i=3 : -2 : -3 
for k=-3 : 2 : 3 
Q (index) =k+j *i; 
index=index+l ; 
end; 
end; 

Q=Q/ (sum (abs (Q) ) /16) ; 

number=le6 ; 
sample=4 ; 

xl^zeros (number '‘'sample, 1) ; 

for i=l : sample : number*sample 

xl (i) =Q (rem ( floor (rand (1,1)*16),16)+1); 

xl (i+1 : i + sample-1 ) =ones (sample-1, 1) *xl (i) ; 

end; 

[b, a] =butter (6,0.5); 
x2=f ilter (b, a, xl) ; 
s=xl (1 : sample : length (xl) ) ; 
clear xl; 


ro=abs (x2 ) ; 
theta=angle (x2) ; 
clear x2; 
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A=1 . 9 638 *ro . / (1 + 0. 9 94 5 *ro . A 2 ) ; 
phi=theta+2 . 5293*ro . A 2 . / ( 1 + 2 . 81 68*ro . A 2) ; 

clear ro; 
clear theta; 
x2twt=A. *exp ( j *phi) ; 

clear A; 
clear phi; 
clear x2; 

x33=f liter (b, a, x2twt ) ; 
x3=x33 (1 :sample/2: length (x33) ) ; 

clear x2twt; 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%sf setrain .m 
%FSE training 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


L=1 5000 ; 

Ly=length (yl ) ; 
ue=yl (1:2: Ly) ; 
uo=yl (2:2: Ly) ; 
clear yl; 

M=5 ; 

delay=ceil (M/2) +1; 

w0e= zeros (M, 1) ;w0o= zeros (M, 1 ) ; 

[e, we, wo] =nlms_fse ( wOe , wOo, ue ( 1 : L) ,uo (1:L) , [zeros (delay, 1) ;s (1:L- 
delay) ] , 0 . 001, 0) ; 

MSE=abs ( e . A 2 ) ; 
clear e; 
figure 

plot (10*logl0 (MSE) ) ; 

J=sum (MSE ( length (MSE) -49: length (MSE) ) ) /50 

ye=conv ( con j ( we ) , ue ) ; 
yo=conv (conj (wo) ,uo) ; 

y=ye (1+delay: length (ue) ) +yo ( 1+delay: length (ue) ) ; 
figure 

plot (y, f . 1 ) ; 

title ( ' training, the signal after FSE, before distortion 1 ); 

clear ue; 
clear uo; 
clear ye; 
clear yo; 
clear yl; 
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%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
%sramtrain!2 .m 
%RAM training for PERC(1,2) 
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


ram=zeros ( 1 6 A 3, 1 ) ; 
ram_count=zeros ( 16 A 3, 1 ) ; 
z=zeros (length (y) , 1) ; 

for n=3 : 15*16 A 3 

s2=f ind ( (s (n-2) *ones (16, 1)-Q)*=0) -1; 
s l=find ( (s (n-1 ) *ones (16, 1) -Q) — 0 ) -1; 
sO=find( (s (n) *ones (16, 1) -Q) ==0 ) -1; 
address=s2+sl*16 A l+sO*16 A 2+l; 
ram (address) =ram (address ) + (s (n) -y (n) ) ; 
ram_count (address) =ram_count (address) + 1; 
end; 

for address-1 : 16 A 3 
if ram_count (address ) -=0 

ram(address) =ram (address) /ram_count (address) ;end; 
end; 


%%%%%%%%%%%%%%%%%%%%%%%«%%%%%«%%% 


%sperc22 .m 
%PERC (2,2) 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


qaml6; 

cluster^ [3 . 96e-3 6.28e-3 0.00995 0.01577 0.025 0.03962] ; 
for index=l:6 

v=randn (length (x3) , 1 ) +j *randn ( length (x3 ) , 1) ; 

v=v/ sqrt (v f *v/length (v) ) *sqrt (cluster (index) *x3 1 *x3 /length (x3 ) ) 

yl=x3+v; 

clear v; 

sf setrain; 
clear yl; 
sramtrain22 ; 

v=randn (length (x3) , 1) +j*randn (length (x3) , 1) ; 

v=v/ sqrt ( v T * v/ length ( v ) ) *sqrt (cluster (index) *x3 T *x3/length (x3) ) 

ylr=x3+v; 

clear v; 

Lyr^length (ylr) ; 

yer=conv (conj (we) , ylr (1:2: Lyr ) ) ; 
yor=conv ( con j (wo) , ylr (2:2: Lyr) ) ; 
clear ylr; 
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yr-yer ( 1+delay : length (s ) ) +yor ( 1+delay : length (s ) ) ; 


clear yer; clear yor; 
figure 

plot (yr, T . 1 ) ; 

title ( T fixed-mode, after fse'); 

zr=zeros (length (yr ) , 1 ) ; 
z_test=zeros (16, 16) ; 
error=zeros (16, 16) ; 
sr-zeros (length (yr ) , 1 ) ; 

sr (l:2)=s (1:2) ; 
zr (1 : 2) =s (1 : 2) ; 

for n=3 : length (yr) 

s2r=f ind ( (sr (n-2) *ones (16,1) -Q) ==0 ) -1; 
slr=f ind ( (sr (n-1) *ones (16, 1) -Q) ==0) -1; 


for i=l:16 
for k= 1 : 1 6 

address=s2r+slr*16 A l+ (k-l)*16 A 2+(i-l)*16 A 3+l; 
z_test (i , k) =yr (n) +r am (address ) ; 

if (i==l & k==l ) besti=l; bestk=l; end; 

error (i, k) =abs (z_test (i, k) -Q (k) ) ; 
if (error (i, k) <error (besti, bestk) ) 
besti=i; bestk=k; 
end; 

end; % test 16 defferent addresses 
end; 

zr (n) =yr (n) tram (s2r+slr* 16 A 1+ (bestk-1) *16 A 2+ (besti-1) *16 A 3+1) 
sr (n) =Q (bestk) ; 
end; % all sequence 
figure 

plot (zr, f . ' ) ; 

title ( 1 fixed-mode, perc 1 ); 
cc=0 ; 

for i=l : length (zr) 

if (s(i)“sr(i)) cc=cc+l; end; 

end; 

error_rate= (length (zr) -cc) /length(zr) 


end; 
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%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
%sramtrain22 .m 
%RAM training for PERC(2,2) 
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


ram=zeros (16 A 4, 1) ; 
ram_count = zeros ( 16 A 4 , 1 ) ; 
z= zeros (length (y) , 1 ) ; 

for n=3:15*16 A 4 

s2=f ind ( (s (n-2) *ones (16, 1) -Q) ==0) -1; 
sl=find( (s (n-1) *ones (16, 1) -Q)==0) -1; 
sO=f ind ( (s (n) *ones (16,1) -Q) ==0) -1; 
s01=f ind ( (s (n+1) *ones (16, 1) -Q) ==0) -1; 
address=s2+s 1*1 6 A l+s0*16 A 2+s01*16 A 3+l ; 

ram (address) =ram (address) + (s (n) -y (n) ) ; 
ram_count (address ) =ram_count (address) +1; 

end; 

for address=l : 1 6 A 4 
if ram_count (address ) -=0 

ram (address) =ram (address) /ram_count (address) ;end; 
end; 


%%%%%%%%%%%«%%%%%%%%%%%%%%%%%%%% 


%perct 12 . m 

%real-data simulation PERC(1,2) 
%%%%%%%%%%«%%%%%%%%%%%%%%%%%%%%% 


f setrain; 

cluster=[3. 96e-3 6.28e-3 0.00995 0.01577 0.025 0.03962] ; 
for index=l : 6 


rand ( 1 seed’ , 0) ; 

v=randn (length (x2) , 1 ) +j *randn (length (x2) , 1) ; 

v-v/sqrt (v' *v/length(v) ) *sqrt ( (cluster (index) ) *x2 1 *x2 /length (x2 ) ) 

yl=x2+v; 

clear v; 

ue=yl (1:2: length (yl) ) ; 
uo=yl (2:2: length (yl) ) ; 

L=15000 ; 

M=20; 
delay=12 ; 

w0e=zeros (M, 1) ;w0o=zeros (M, 1) ; 

[e, we, wo] =nlms_fse (wOe, wOo, ue ( 1 : L) , uo (1:L) , [zeros (1, delay) s (1 :L- 
delay) ] , 0.01, 0) ; 
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MSE=abs (e. A 2) ; 

figure 

plot (MSE) ; 

J=sum(MSE (length (MSE) -49: length (MSE) ) ) /50 
Lr=length (ue) ; 

ye=zeros (length (we) +length (ue) -1, 1) ; 
yo=zeros (length (wo) +length (uo) -1, 1) ; 
ye=conv (conj (we) , ue) ; 
yo=conv(conj (wo) ,uo) ; 

y=ye (1+delay: length (s) ) +yo ( 1+delay: length (s ) ) ; 


ramtrainl2t ; 

% fixed mode 

v=randn (length (x2 ) , 1 ) +j +randn ( length (x2) , 1) ; 

v®v/ sqrt (v f *v/length ( v) ) *sqrt ( (cluster (index) ) *x2 T *x2 /length (x2 ) ) 

ylr=x2+v; 

clear v; 

Lyr=length (ylr) ; 

yer=conv (con j (we) , ylr (1 :2:Lyr) ) ; 
yor=conv (conj (wo) , ylr (2 : 2 : Lyr ) ) ; 
yr=yer ( 1+delay : length ( s ) ) +yor (1+delay : length ( s ) ) ; 

zr= zeros ( length (yr) , 1) ; 
z_test=zeros (16, 1) ; 
error=zeros (16, 1) ; 
sr=zeros (length (yr) , 1) ; 

sr (l:2)=s (1:2) ; 
zr (1 : 2)=s (1 : 2) ; 

for n=3 : length (yr) 

s2r=find ( (sr (n-2) *ones ( 16, 1 ) -Q) ==0) -1; 
slr=f ind ( (sr (n-1) *ones (16,1) -Q) ==0) -1; 

for i-1 : 16 

address=s2r+slr*16 A 1+ (i-1) *16 A 2+1; 
z_test (i) =yr (n) +ram (address ) ; 
if (i==l) best=l;end; 

error (i)=abs (z_test (i)-Q(i) ) ; 
if (error (i) <error (best) ) best=i; end; % if 
end; % test 16 defferent addresses 
zr (n) =yr (n) +ram (s2r+slr*16 A l+ (best-1 ) *16 A 2+1 ) ; 
sr (n) =Q (best) ; 
end; % all sequence 


figure 

plot (zr, r . 1 ) ; 

title ( T fixed-mode, perc f ) ; 
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cc=0; 

for i=l : length ( zr ) 

if (s (i)==sr (i) ) cc=cc+l; end; 

end; 

error_rate= { length { zr ) -cc) /length (zr) 


%%%%%%%%%%%%%%%%%%%%%%%%%% 

%f setrain . m 
%Real-data simulation 
%load data and FSE training 
%%%%%%%%%%%%%%%%%%%%%%%%%% 

clear 

% load training sequence, sent and received. 

load . /data/ s cl . dat 

load . /data/sc2 . dat 

load . /data/sc3 . dat 

load . /data/sc4 . dat 

L=length ( scl ) ; 
xl=scl (1:2:L) + j *sc2 ( 1 : 2 : L) ; 
clear scl; 
clear sc2; 

Lx=length (xl ) ; 
x=xl ( 1 : 2 : Lx) ; 

x2=sc4 { 1 : 2 : L) + j *sc3 ( 1 : 2 : L) ; 
clear sc3; 
clear sc4; 

constellation; 

for i=l : length (x) 

s (i ) ^quantize (x (i) ,Q) ; 

end; 

center_Q=sum ( Q) /length (Q) ; 

Q=Q-center_Q; 

s-s-center_Q; 

cent er_y=sum ( x2 ) /length ( x2 ) ; 
x2=x2-center_y; 

rand ( 1 seed ! , 0 ) ; 

v=randn (length (x2) , 1 ) +j *randn ( length (x2 ) , 1) ; 

v=v/ sqrt (v T *v/length (v) ) *sqrt ( (cluster ) *x2 f *x2/length (x2 ) ) 

yl=x2+v; 

clear v; 

ue=yl (1:2: length (yl) ) ; 
uo=yl (2:2: length (yl) ) ; 

L=15000; 

M=20 ; 
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delay=12; 

wOe=zeros (M, 1) ;wOo=zeros (M, 1) ; 

[e, we, wo]=nlms_fse (wOe, wOo, ue ( 1 : L) ,uo ( 1 : L) , [zeros (1, delay) s (1:L- 
delay) ] , 0 . 01, 0) ; 

MSE=abs (e . A 2) ; 

figure 

plot (MSE) ; 

J=sum(MSE (length (MSE) -4 9: length (MSE) ) ) /50 
Lr=length (ue) ; 

ye=zeros (length (we) +length (ue) -1, 1) ; 
yo=zeros (length (wo) tlength (uo) -1, 1) ; 
ye=conv ( con j ( we ) , ue ) ; 
yo=conv ( con j ( wo ) , uo ) ; 

y=ye (1+delay: length (s) ) +yo (1+delay: length (s) ) ; 
figure 

plot (y, 1 . f ) ; 

title { T training, the signal after FSE, before distortion’); 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


%ramtrainl2t . m 

%real-data simulation RAM training for PERC(1,2) 
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%«%%%%%% 


ram= zeros ( 16 A 3, 1 ) ; 
ram_count = zeros (16 A 3, 1) ; 
z=zeros (length(y) , 1) ; 

for n=3 : length (y) 

s2=f ind ( (s (n-2) *ones (16, 1) -Q)==0) -1; 
s l=f ind ( (s (n-1) *ones (16, 1)-Q)“0) -1; 
s0=f ind ( ( s (n) *ones (16,1) -Q) ==0) -1; 
address=s2+sl*l 6 A l+s0^16 A 2+l; 
ram (address ) =ram (address ) + ( s (n) -y (n) ) ; 
ram_count (address) =ram_count (address) +1; 

end; 


for address=l : 1 6 A 3 
if ram__count (address) -=0 

ram (address) =ram (address ) . /ram_count (address) ;end; 
end; 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
% cons tell at ion .m 
% Constellation 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
QAM=zeros (16, 1) ; 
count=zeros (16, 1) ; 
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for i=l : length (x) 

if ( (real (x(i) )<=-0.02) & ( imag (x (i ) ) >0 . 035 ) ) QAM ( 1 ) =QAM ( 1 ) +x (i ) 

count (1 ) =count ( 1 ) +1 ; end; 

if ( (“0 . 02<real (x (i) ) ) & (real (x (i) ) <=0 . 005 ) & ( imag (x ( i ) )>0. 035) ) 

QAM ( 2 ) =QAM ( 2 ) + x ( i ) ; 

count (2) =count (2) +l;end; 

if ( (0 . 005<real (x (i) ) ) & (real (x (i) ) <=0 . 035) & (imag (x(i)) >0.035)) 

QAM ( 3 ) =QAM ( 3 ) +x ( i ) ; 

count ( 3 ) =count ( 3 ) + 1 ; end ; 

if ( (real (x (i) ) >0 . 035) & (imag (x (i) ) >0.035) ) QAM ( 4 ) =QAM ( 4 ) +x (i) ; 

count { 4 ) =count ( 4 ) + 1 ; end ; 

if ( ( real (x { i ) ) <=-0 . 02 ) & ( Ocimag (x (i ) ) ) & (imag (x (i ) ) <=0 .035) ) 

QAM ( 5 ) -QAM ( 5 ) +x ( i ) ; 

count (5) =count ( 5 ) +1; end; 

if ( (“0 . 02<real (x (i) ) ) & (real (x (i ) ) <=0 . 005) & (0 . 005<imag (x (i) ) ) & 

{ imag (x (i ) )<=0.035)) QAM ( 6 ) =QAM ( 6) +x ( i ) ; 

count (6) =count ( 6) +1 ; end; 

if ( (0 . 005<real (x (i) ) ) & (real (x (i) ) <=0 . 035) & (0 . 005<imag (x (i) ) ) & 
(imag (x (i) ) <=0 . 035) ) QAM (7 ) =QAM ( 7 ) *t*x (i) ; 

count (7 ) =count ( 7 ) +1; end; 

if { (real (x ( i) ) >0 . 035 ) & ( 0<imag (x (i ) ) ) & (imag (x (i) ) <=0 . 035) ) 

QAM ( 8 ) =QAM ( 8 ) +x { i ) ; 

count ( 8 ) =count ( 8 ) + 1 ; end ; 

if ( ( real (x ( i ) ) <=-0. 02) & (-0. 02<imag (x (i) ) ) & (imag(x(i) )<=0.005) ) 

QAM ( 9 ) =QAM ( 9 ) +x ( i ) ; 

count ( 9 ) =count ( 9 ) + 1 ; end ; 
if ( (-0. 02<real (x(i) ) ) & (real (x (i ) ) <«0 . 005 ) & (- 
0 . 02<imag (x (i) ) ) & (imag(x (i) ) <=0. 005) ) QAM ( 10 ) =QAM ( 10 ) +x ( i ) ; 

count ( 10 ) =count ( 10 ) +1 ; end; 
if ( (0 . 005<real (x (i) ) ) & ( real (x (i ) ) <=0 . 035 ) & (- 
0 . 02<imag (x (i) ) ) & (imag (x (i) ) <=0 . 005 ) ) QAM ( 11 ) =QAM { 11) +x (i) ; 

count ( 11 ) =count (11 ) +1; end; 

if ( (real (x (i) ) >0 . 035) & (-0 . 02<imag (x (i) ) ) & ( imag (x (i) ) <=0 .005) ) 

QAM ( 12 ) =QAM ( 12 ) +x { i ) ; 

count ( 12 ) =count (12 ) +1 ; end; 

if ( (real (x ( i ) ) <=-0 . 02) & (imag (x (i ) )<=-0.02) ) QAM ( 13 ) =QAM ( 13) +x (i ) ; 

count (13) =count (13) +l;end; 

if ( (-0 . 02<real (x (i) ) ) & (real (x (i) ) <=0 . 005) & (imag (x (i) ) <=-0 . 02) ) 

QAM (14) =QAM (14) +x ( i ) ; 

count (14 ) =count (14 ) +1; end; 

if ( ( 0 . 005<real (x (i) ) ) & (real (x (i) ) <=0 . 035) & (imag (x (i) ) <=-0 . 02) ) 

QAM ( 15 ) =QAM ( 15 ) +x ( i ) ; 

count (15) =count ( 1 5 ) + 1 ; end ; 

if ( (real (x (i) ) >0. 035) & (imag (x (i) )<=-0 . 02) ) QAM (16) =QAM (16) +x (i) ; 

count (16) =count ( 1 6 ) + 1 ; end ; 

end; 

Q=QAM . /count ; 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%quantize .m 
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%Quantization 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

function d=quantize (c f Q) 

for tt=l : 16 

dist (tt) =abs (c-Q (tt ) ) ; 

end; 

d=Q ( find ( (abs (dist ) -min (abs (dist) ) ) ==0) ) ; 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%perc22t .m 

%real“data simulation PERC(2,2) 
%%%%%%%%%%%%%%%■%%%%%%%%%%%%%%%%% 


f setrain; 

cluster= [3 . 96e-3 6.28e-3 0.00995 0.01577 0.025 0.03962] ; 
for index=l : 6 
rand ( 1 seed T , 0 ) ; 

v=randn (length (x2) , 1) +j *randn (length (x2 ) , 1) ; 

v=v/ sqrt (v T *v/length (v) ) *sqrt ( (cluster (index) ) *x2 1 *x2 /length (x2) ) 

yl=x2+v; 

clear v; 

ue=yl (1:2: length ( y 1 ) ) ; 
uo=yl (2:2: length (yl) ) ; 

L= 15000; 

M=20 ; 
delay=12 ; 

wOe=zeros (M, 1 ) ; w0o=zeros (M, 1 ) ; 

[e,we,wo] =nlms_f se (wOe, wOo, ue ( 1 : L) ,uo(l:L) , [ zeros ( 1 , delay) s (1:L- 
delay) ] , 0 . 01, 0) ; 

MSE=abs (e. A 2) ; 

figure 

plot (MSE) ; 

J=sum (MSE (length (MSE) -4 9: length (MSE) ) ) /50 
Lr=length (ue) ; 

ye=zeros ( length (we) tlength (ue ) -1 , 1) ; 
yo=zeros (length (wo) ^length (uo) -1, 1 ) ; 
ye=conv(conj (we) , ue) ; 
yo=conv(conj (wo) ,uo) ; 

y=ye ( 1+delay : length (s) ) +yo ( 1+delay: length ( s ) ) ; 
ramtrain22t ; 

% fixed mode 

v=randn (length (x2) , 1) + j *randn ( length (x2) , 1) ; 

v-v/ sqrt ( v T * v/ lengt h (v) ) *sqrt ( (cluster (index) } *x2 1 *x2/ length (x2) ) 
ylr=x2+v; 
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Lyr=length (ylr ) ; 

yer-conv (conj (we) , ylr (1 : 2 : Lyr) ) ; 
yor=conv {conj (wo) , ylr (2:2: Lyr) ) ; 

yr=yer (1+delay : length (s) ) +yor ( 1+del ay : length (s) ) ; 

zr=zeros (length (yr) , 1) ; 
z_test=zeros (16, 16) ; 
error=zeros (16, 16) ; 
sr=zeros (length (yr) , 1) ; 

sr (l:2)=s (1:2) ; 
zr (1 : 2 ) *s ( 1 : 2 ) ; 

for n=3 : length (yr) 

s2r=find ( (sr (n-2) *ones ( 16, 1 ) -Q) ==0 ) -1 ; 
slr=f ind ( (sr (n-1) + ones (16, l)-Q)-=0)-l; 

for i=l : 16 
for k=l : 16 

address=s2r+slr*16 A l+ (k-l)*16 A 2+(i-l)*16 A 3+l; 
z_test (i, k) =yr (n) +ram (address) ; 

if (i==l & k==l ) besti=l; best k=l ; end; 

error (i, k)=abs (z_test (i, k) -Q (k) ) ; 
if (error (i, k) cerror ( best i , best k) ) 
besti=i; bestk=k; 
end; 

end; % test 16 defferent addresses 
end; 

zr (n) =yr (n)+ram(s2r+slr'* r 16 A l+(bestk-l)*16 A 2+(besti-l)*16 A 3+l) 
sr (n) =Q (best k) ; 
end; % all sequence 

figure 

plot (zr, T .M; 

title ( T fixed-mode, perc T ); 
cc= 0 ; 

for i=l : length ( zr) 

if (s ( i ) ==sr (i ) ) cc=cc+l; end; 

end; 

er ror_r a te= (length ( zr ) -cc) /length ( zr) 
end; 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%ramtrain22t . m 

%real-data simulation RAM training for PERC(2,2) 
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

ram= zeros ( 1 6 A 4 , 1 ) ; 
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ram_count = zeros (16 A 4, 1) ; 
z= zeros (length (y) , 1) ; 

for n=3 : length (y) -1 

s2=find( (s (n-2) *ones (16, 1) -Q) ==0) -1; 
sl=f ind ( (s (n-1) *ones (16, 1) -Q)==0) -1; 
sO=f ind ( (s (n) *ones ( 16, 1 ) -Q) ==0 ) -1 ; 
s01=f ind ( (s (n+1) *ones (16, 1)-Q)==0) -1; 
address=s2+sl*16 A l+sO* 1 6 A 2+s01*16 A 3+l ; 

ram (address) =ram (address ) + (s (n) -y (n) ) ; 
ram_count (address) =ram_count (address) +1; 
end; 

for address=l : 16 A 4 
if ram_count (address ) ^=0 

ram(address)=ram(address) . /ram_count (address) ;end; 
end; 
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